CURLER: Finding and Visualizing Nonlinear Correlation Clusters
نویسندگان
چکیده
While much work has been done in finding linear correlation among subsets of features in high-dimensional data, work on detecting nonlinear correlation has been left largely untouched. In this paper, we present an algorithm for finding and visualizing nonlinear correlation clusters in the subspace of high-dimensional databases. Unlike the detection of linear correlation in which clusters are of unique orientations, finding nonlinear correlation clusters of varying orientations requires merging clusters of possibly very different orientations. Combined with the fact that spatial proximity must be judged based on a subset of features that are not originally known, deciding which clusters to be merged during the clustering process becomes a challenge. To avoid this problem, we propose a novel concept called co-sharing level which captures both spatial proximity and cluster orientation when judging similarity between clusters. Based on this concept, we develop an algorithm which not only detects nonlinear correlation clusters but also provides a way to visualize them. Experiments on both synthetic and real-life datasets are done to show the effectiveness of our method.
منابع مشابه
Visualizing the Clusters and Dynamics of HPV Research Area
Purpose: The purpose of the present study is to visualize HPV clusters’ relationships and thematic trends in the world. Methodology: The research type is an applied one with analytical approach and it has been done using co-word analysis. The population of this study consists of articles’ keywords indexed during 2014-2018 in the Web of Science (WoS) in HPV subject area. The total numbers of th...
متن کاملWhy do Bimetallic Clusters have more Chemical Reactivity? Study the VnNim (2 ≤ n + m ≤ 6) Clusters as the Nano Species
This article gives you proof that bimetallic transition metal clusters with the difference in electronegativity are better catalysts than monoatomic one. To prove this fact, a study of ethylene adsorption on bimetallic clusters vanadium-nickel VnNim (2≤n+m≤6) has been demonstrated. Our result shows that hardness has a quite good linear correlation with the non-Lewis of VnNi (n=1-5) cluster (R2=...
متن کاملFinding and Visualizing Subspace Clusters of High Dimensional Dataset Using Advanced Star Coordinates
Analysis of high dimensional data is a research area since many years. Analysts can detect similarity of data points within a cluster. Subspace clustering detects useful dimensions in clustering high dimensional dataset. Visualization allows a better insight of subspace clusters. However, displaying such high dimensional database clusters on the 2-dimensional display is a challenging task. We p...
متن کاملFinding and Visualizing Graph Clusters Using PageRank Optimization
We give algorithms for finding graph clusters and drawing graphs, highlighting local community structure within the context of a larger network. For a given graph G, we use the personalized PageRank vectors to determine a set of clusters, by optimizing the jumping parameter α subject to several cluster variance measures in order to capture the graph structure according to PageRank. We then give...
متن کاملVisualizing Multiple System Atrophy Studies Based on Collaboration Network and Centrality Indices in Web of Science Database
Introduction: Social network analysis is an analytical method based on graph theories that identifies relationships between individuals or factors to analyze the social structures resulted from those relationships. The objective of this study was to analyze co-authorship and co-word networks based on scientometric indicators and centrality measures in the studies on multiple atrophy system dise...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005